22 results found.
Speech
Corpus,
Language Type:
Monolingual
Languages:
Bengali Czech Dari English Hindi Lao Mandarin Chinese Mesopotamian Arabic Moroccan Arabic North Levantine Arabic Panjabi Persian Polish Pushto Russian Slovak South Levantine Arabic Spanish Standard Arabic Tamil Thai Turkish Ukrainian Urdu
Availability:
From Owner
License:
LDC
Size:
204 hours Production Status:
Existing-used
Use:
Language Identification
-
Paper title:Metric learning loss functions to reduce domain mismatch in the x-vector space for language recognition
-
Paper track:4.1 Language identification and verification, lang/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Raphaël Duroselle | 2011 NIST Language Recognition Evaluation Test Set | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Basque Belgian Dutch Croatian Czech Galician Greek Hungarian Portuguese Slovak Slovenian Spanish
Availability:
From Owner
License:
Size:
None Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:An Approach to Online Speaker Change Point Detection Using DNNs and WFSTs
-
Paper track:5.4 Speech and audio segmentation/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Lukas Mateju | COST278 database | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech Romanian Slovak Spanish Vietnamese
Availability:
Freely Available
License:
<Not Specified>
Size:
55 GByte Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Diacritics Restoration Using Neural Networks
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Jakub Náplava | Charles University, Institute of Formal and Applied Linguistics | CZ | ||
| Author 2 | Milan Straka | Charles University | None | ||
| Author 3 | Pavel Straňák | Charles University in Prague | CZ | ||
| Author 4 | Jan Hajic | Charles University in Prague | CZ | Charles University | CZ |
| Main Contact | Jakub Náplava | Charles University, Institute of Formal and Applied Linguistics | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Slovak
Availability:
Not Available
License:
N/A
Size:
25 hours Production Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
-
Paper title:An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation
-
Paper track:Speech
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Peter Viszlay | Technical University of Košice, Slovakia | SK | ||
| Author 2 | Jan Stas | Technical University of Kosice | SK | Technical University of Kosice | SK |
| Author 3 | Tomáš Koctúr | Technical University of Kosice | SK | ||
| Author 4 | Martin Lojka | Technical University of Kosice | SK | ||
| Author 5 | Jozef Juhár | Technical University of Kosice | SK | ||
| Main Contact | Peter Viszlay | Technical University of Košice, Slovakia | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Slovak
Availability:
Freely Available
License:
custom
Size:
1500000 tokens Production Status:
Newly created-in progress
Use:
Document Classification, Text categorisation
-
Paper title:The Slovak Categorized News Corpus
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Daniel Hládek | Technical University of Kosice | SK | ||
| Author 2 | Jan Stas | Technical University of Kosice | SK | Technical University of Kosice | SK |
| Author 3 | Jozef Juhar | Technical University of Kosice | SK | ||
| Main Contact | Daniel Hládek | Technical University of Kosice | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Slovak
Availability:
We are working hard to acquire the broadcaster agreement of using the captured multimedia content and annotations outside our laboratory
License:
ELRA
Size:
265 hours Production Status:
Newly created-finished
Use:
Speech Recognition/Understanding
-
Paper title:TUKE-BNews-SK: Slovak Broadcast News Corpus Construction and Evaluation
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Matus Pleva | Technical University of Kosice | SK |
| Author 2 | Jozef Juhar | Technical University of Kosice | SK |
| Main Contact | Matus Pleva | Technical University of Kosice | None |
Documentation:
No
Written
Tokenizer,
Language Type:
Multilingual
Languages:
Slovak
Availability:
From Owner
License:
open source
Size:
1 MByte Production Status:
Existing-used
Use:
Acquisition
-
Paper title:The Slovak Categorized News Corpus
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Daniel Hládek | Technical University of Kosice | SK | ||
| Author 2 | Jan Stas | Technical University of Kosice | SK | Technical University of Kosice | SK |
| Author 3 | Jozef Juhar | Technical University of Kosice | SK | ||
| Main Contact | Daniel Hládek | Technical University of Kosice | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Slovak
Availability:
Freely Available
License:
GPL
Size:
4 scripts OtherProduction Status:
Existing-updated
Use:
Speech Recognition/Understanding
-
Paper title:TUKE-BNews-SK: Slovak Broadcast News Corpus Construction and Evaluation
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Matus Pleva | Technical University of Kosice | SK |
| Author 2 | Jozef Juhar | Technical University of Kosice | SK |
| Main Contact | Matus Pleva | Technical University of Kosice | None |
Documentation:
No
Written
Corpus,
Language Type:
Multilingual
Languages:
English Slovak
Availability:
Freely Available
License:
CreativeCommons
Size:
1 194 084 tokens Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:Evaluation Set for Slovak News Information Retrieval
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Daniel Hládek | Technical University of Kosice | SK | ||
| Author 2 | Jan Stas | Technical University of Kosice | SK | Technical University of Kosice | SK |
| Author 3 | Jozef Juhar | Technical University of Kosice | SK | ||
| Main Contact | Daniel Hládek | Technical University of Kosice | None |
Documentation:
In the archive
Written
Corpus,
Language Type:
Multilingual
Languages:
Bulgarian Croatian Hungarian Polish Romanian Slovak Slovenian
Availability:
From Data Center(s)
License:
Creative Commons
Size:
727 millions sentences Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:The MARCELL Legislative Corpus
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tamás Váradi | MARCELL Legislative Corpus | /N |
Documentation:
the present LREC submission




